Preference for 50% reinforcement over 75% reinforcement by pigeons
نویسندگان
چکیده
منابع مشابه
Preference-based Reinforcement Learning
This paper investigates the problem of policy search based on the only expert’s preferences. Whereas reinforcement learning classically relies on a reward function, or exploits the expert’s demonstrations, preference-based policy learning (PPL) iteratively builds and optimizes a policy return estimate as follows: The learning agent demonstrates a few policies, is informed of the expert’s prefer...
متن کاملRisky choice in pigeons: preference for amount variability using a token-reinforcement system.
Pigeons were given repeated choices between variable and fixed numbers of token reinforcers (stimulus lamps arrayed above the response keys), with each earned token exchangeable for food. The number of tokens provided by the fixed-amount option remained constant within blocks of sessions, but varied parametrically across phases, assuming values of 2, 4, 6, or 8 tokens per choice. The number of ...
متن کاملConcurrent drinking by pigeons on fixed-interval reinforcement schedules.
Three experienced pigeons were exposed to at least ten consecutive 100-min sessions on each of three food-reinforced fixed-interval (FI) schedules: FI 50-sec, FI 100-sec and FI 200-sec. Water was freely available. Drinking was largely confined to the first third of each fixed interval, and the mean sessional water intake was directly related to the food-reinforcement rate for each animal. The a...
متن کاملTowards Preference-Based Reinforcement Learning
This paper makes a first step toward the integration of two subfields of machine learning, namely preference learning and reinforcement learning (RL). An important motivation for a preference-based approach to reinforcement learning is the observation that in many real-world domains, numerical feedback signals are not readily available, or are defined arbitrarily in order to satisfy the needs o...
متن کاملPreference-Based Policy Iteration: Leveraging Preference Learning for Reinforcement Learning
This paper makes a first step toward the integration of two subfields of machine learning, namely preference learning and reinforcement learning (RL). An important motivation for a “preference-based” approach to reinforcement learning is a possible extension of the type of feedback an agent may learn from. In particular, while conventional RL methods are essentially confined to deal with numeri...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Learning & Behavior
سال: 2009
ISSN: 1543-4494,1543-4508
DOI: 10.3758/lb.37.4.289